National Repository of Grey Literature 33 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
State of the art speech features used during the Parkinson disease diagnosis
Bílý, Ondřej ; Smékal, Zdeněk (referee) ; Mekyska, Jiří (advisor)
This work deals with the diagnosis of Parkinson's disease by analyzing the speech signal. At the beginning of this work there is described speech signal production. The following is a description of the speech signal analysis, its preparation and subsequent feature extraction. Next there is described Parkinson's disease and change of the speech signal by this disability. The following describes the symptoms, which are used for the diagnosis of Parkinson's disease (FCR, VSA, VOT, etc.). Another part of the work deals with the selection and reduction symptoms using the learning algorithms (SVM, ANN, k-NN) and their subsequent evaluation. In the last part of the thesis is described a program to count symptoms. Further is described selection and the end evaluated all the result.
Emotional State Recognition Based on Speech Signal Analysis
Čermák, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The thesis is focused on the emotional states classification in the Matlab program, using neural networks and the classifier which is based on a combination of Gaussian density functions. It deals with the speech signal processing; the prosodic and spectral signs and the MFCC coefficients were extracted from the signal. The work also deals with the quality evaluation of individual signs of which the most suitable were chosen in order to provide the correct classification of emotional states. In order to identify the emotional states, two different methods were used. The first method of classification was the use of neural networks with differently selected parameters, and the second method was the use of the Gaussian mixture model (GMM). In both methods, a database of emotional utterances was divided into the training group and the test group. The testing was based on a method independent of the speaker. The work also includes the comparison of individual analyzed methods as well as the representation and comparison of the results. The conclusion comprises a proposition for the best parameters and the best classifier for the recognition of the speaker’s emotional state.
Linear prediciton and cepstral synthesis of speech signal in the TTS system
Mekyska, Jiří ; Stejskal, Vojtěch (referee) ; Smékal, Zdeněk (advisor)
This work deals with a linear prediction and cepstral synthesis of speech signal in the TTS (Text-to-Speech) systems with the opportunity of modeling the prosody. The work contains a description of speech signal in acoustic and phonetic plane, the principle of speech production and the way we can figure the speech signal in time and frequency domain. Next, there is the TTS block structure mentioned, whereas each block has its own detailed description. In the work, the modeling of prosody using the three most important suprasegmental features (fundamental tone, continuation and speech intensity) is also described. At the end of this work, there is a design and realization of universal Czech TTS system which is based on the speech synthesis in frequency domain. This system is implemented in program MATLAB.
Emotional State Recognition and Classification Based on Speech Signal Analysis
Černý, Lukáš ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis focuses on classification of emotions. Thesis deals about parameterization of sounds files by suprasegment and segment methods with regard for next used of these methods. Berlin database is used. This database includes many of sounds records with emotions. Parameterization creates files, which are divided to two parts. First part is used for training and second part is used for testing. Point of interest is self-organization network. Thesis includes Matlab´s program which can be used for parameterization of any database. Data are classified by self-organization network after parameterization. Results of hits rates are presented at the end of this diploma thesis.
Database of vocal samples of human emotions
Hlavica, Michal ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor)
In this bachelor work is analyzed theory of emotions, how emotions arise and how they are physiologically expressed by human body. How these physiological expressions and emotions reflect into the human speech. Then is described process of creating of speech and basic prosodic and acoustic parameters relevant for research. Theory of creating of databases is described here as well, which is quality ground for database itself. The database is also part of this thesis and they are records cut from television programmes and serials. The next very important issue is description of software tool for subjective evaluating of databases, which was created as a part of this thesis. It was created in C++ language with help by compiler Builder C++ . Also a short analysis of exemplary records for every emotion is done here. This analysis deals with basic frequency, intensity and first three formants.
Multilingual analysis of human emotional states
Rendek, Tomáš ; Koula, Ivan (referee) ; Atassi, Hicham (advisor)
This work deals with the properties of the speech signal. At the beginning it introduces a process of generation of the speech. Then, it covers the prosodic features of the speech, which represent a related characteristic of emotions. It defines an emotion itself, as well as the basic features and parameters of the human speech. For the analysis we use the program called Praat. As it is an unknown program, we devote a part of the work to it, which acquaints us with its advantages. The next part of this paper comprises also two enclosed databases containing records of particular emotional states of human. These databases were created and collected for Slovak and German language. However, none of them contain spontaneous material. Next, the work concerns a concept of the neural networks. It regards it as a possible realization of recognizing of emotional characteristics. The initial analysis presents large number of gained features, out of which only the best twelve were selected on the basis of geometric separability. These features are distinct for both sexes, as well as for both nationalities. Consequently, they are used for training with a neural network. The work concludes by summarizing of the results discussing the successfulness with recognition of emotional states. It also gives possible reasons which lead to degradation of their successful classifying. The thesis contains a CD with all the partial and ultimate results, and files with records for Slovak and German language.
Application for the calculation of speech features describing hypokinetic dysarthria
Hynšt, Miroslav ; Mekyska, Jiří (referee) ; Kiska, Tomáš (advisor)
This thesis is about design and implementation of application for computing speech parameters on people with Parkinson disease. At the beginning is generaly described Parkinson disease and Hypokinetic dysarthria and how it affects the speech and speech parameters when it occurs. Mainly there are described areas of speech like phonation, prosody, articulation and fluent speech. As a part of next topic this thesis describes specific speech parameters with bigger meaning during diagnosis Parkinson disease and it's progress over the time. There are also mentioned few significant studies dealing with examination of speech of the subjects with diagnoses of Parkinson disease and computing some speech parameters in order to analyze their speech impairments. Part of the thesis is description of implemented standalone application for calculating, exporting and visualizing of speech parameters from selected sound records.
Automatic / Automated recogniton of emotional states based on utterance analysis
Pfeifer, Leon ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states. The thesis consists of three parts. The first part is charcterize, the process of speech generating, from phonetic and psychological poin of view. In the second part there are proccesed metods and contextual things.(preprocessing of signal, voice activity detector). For calculation fundamental Frequency it was used metod of central clipping, another used metod is formant frequency analyse and the last is metod of determinatin of nuber of thorns and planes. In the thirt part there are proccesesed results of measurements performed by particural metods. It was scorred five different emotional states: neutral, anger, happiness, sadness and surprise. At the end of this part there are discussed results for each metod.
Multiplatform gateway for voice communication in real-time
Starzyczny, Radek ; Krkoš, Radko (referee) ; Novotný, Bohumil (advisor)
This master's thesis is focused on VoIP communications. It describes deploy of the operating system OpenWRT, analog interface of router Gigaset SX762 and GSM gateway for receiving or place calls. The paper describes the protocols involved in the communication and basic configuration elements. Deploying IP telephony enables to reduce the cost of operation and provides a number of additional functions.
Modelling Prosodic Dynamics for Speaker Recognition
Jančík, Zdeněk ; Fapšo, Michal (referee) ; Matějka, Pavel (advisor)
Most current automatic speaker recognition system extract speaker-depend features by looking at short-term spectral information. This approach ignores long-term information. I explored approach that use the fundamental frequency and energy trajectories for each speaker. This approach models prosody dynamics on single fonemes or syllables. It is known from literature that prosodic systems do not work as well the acoustic one but it improve the system when fusing. I verified this assumption by fusing my results with state of the art acoustic system from BUT. Data from standard evaluation campaigns organized by National Institute of Standarts and Technology are used for all experiments.

National Repository of Grey Literature : 33 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.